Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Statistical Method for an Automatic Detection of Form Types

Identifieur interne : 002019 ( Main/Exploration ); précédent : 002018; suivant : 002020

A Statistical Method for an Automatic Detection of Form Types

Auteurs : Saddok Kebairi [France] ; Bruno Taconet [France] ; Abderrazak Zahour [France] ; Said Ramdane [France]

Source :

RBID : ISTEX:8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9

Abstract

Abstract: In this paper, we present a method to classify forms by a statistical approach; the physical structure may vary from one writer to another. An automatic form segmentation is performed to extract the physical structure which is described by the main rectangular block set. During the form learning phase, a block matching is made inside each class; the number of occurrences of each block is counted, and statistical block attributes are computed. During the phase of identification, we solve the block instability by introducing a block penalty coefficient, which modifies the classical expression of Mahalanobis distance. A block penalty coefficient depends on the block occurrence probability. Experimental results, using the different form types, are given.

Url:
DOI: 10.1007/3-540-48172-9_8


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Statistical Method for an Automatic Detection of Form Types</title>
<author>
<name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
</author>
<author>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
</author>
<author>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
</author>
<author>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1007/3-540-48172-9_8</idno>
<idno type="url">https://api.istex.fr/document/8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002688</idno>
<idno type="wicri:Area/Istex/Curation">002509</idno>
<idno type="wicri:Area/Istex/Checkpoint">001570</idno>
<idno type="wicri:doubleKey">0302-9743:1999:Kebairi S:a:statistical:method</idno>
<idno type="wicri:Area/Main/Merge">002128</idno>
<idno type="wicri:Area/Main/Curation">002019</idno>
<idno type="wicri:Area/Main/Exploration">002019</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A Statistical Method for an Automatic Detection of Form Types</title>
<author>
<name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d’Informatique du Havre, Université du Havre, Place Robert Schuman, 76610, Le Havre</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Le Havre</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>1999</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9</idno>
<idno type="DOI">10.1007/3-540-48172-9_8</idno>
<idno type="ChapterID">8</idno>
<idno type="ChapterID">Chap8</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: In this paper, we present a method to classify forms by a statistical approach; the physical structure may vary from one writer to another. An automatic form segmentation is performed to extract the physical structure which is described by the main rectangular block set. During the form learning phase, a block matching is made inside each class; the number of occurrences of each block is counted, and statistical block attributes are computed. During the phase of identification, we solve the block instability by introducing a block penalty coefficient, which modifies the classical expression of Mahalanobis distance. A block penalty coefficient depends on the block occurrence probability. Experimental results, using the different form types, are given.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Haute-Normandie</li>
<li>Région Normandie</li>
</region>
<settlement>
<li>Le Havre</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Région Normandie">
<name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
</region>
<name sortKey="Kebairi, Saddok" sort="Kebairi, Saddok" uniqKey="Kebairi S" first="Saddok" last="Kebairi">Saddok Kebairi</name>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<name sortKey="Ramdane, Said" sort="Ramdane, Said" uniqKey="Ramdane S" first="Said" last="Ramdane">Said Ramdane</name>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<name sortKey="Taconet, Bruno" sort="Taconet, Bruno" uniqKey="Taconet B" first="Bruno" last="Taconet">Bruno Taconet</name>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
<name sortKey="Zahour, Abderrazak" sort="Zahour, Abderrazak" uniqKey="Zahour A" first="Abderrazak" last="Zahour">Abderrazak Zahour</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002019 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002019 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:8B5FF991DBC62B054AA1AC2D82E422CCE847DBF9
   |texte=   A Statistical Method for an Automatic Detection of Form Types
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024